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Abstract — In this paper, we will use the construction tech- 
nique proposed in [1] to construct multidimensional trellis coded 
modulation (TCM) codes for both the additive white Gaussian 
noise (AWGN) and the fading channels. Analytical performance 
bounds and simulation results show that these codes perform very 
well and achieve significant coding gains over uncoded reference 
modulation systems. In addition, the proposed technique can 
be used to construct codes which have a performance/decoding 
complexity advantage over the codes listed in literature. 

Index Terms — AWGN channel, fading channel, multidimen- 
sional MPSK TCM codes. 


I. Introduction 

A S WAS POINTED out in [1], for modulation codes 
over the additive white Gaussian noise (AWGN) chan- 
nel, the main parameter of interest is the minimum squared 
Euclidean distance between the transmitted code sequences 
and the number of nearest neighbors. Details on the above 
parameters are available in [2] and [3], and as such, we will not 
reiterate these design considerations here. The aforementioned 
design considerations will be the basis of construction of the 
modulation codes for the AWGN channel in this paper. 

If the channel is changed to a fading channel, most codes 
designed for the AWGN channel no longer perform well, sim- 
ply because the design parameters of a modulation code which 
need to be optimized for the fading channel are different from 
that for the AWGN channel. For the fading channel, we shall 
consider two scenarios. For the first case, we shall consider the 
Rayleigh-fading channel with slow fading, coherent detection, 
no channel state information, independent symbol fading and 
minimum squared Euclidean distance as the decoding metric. 
These assumptions have been considered so as to enable us to 
compare our codes with the ones listed in literature. Examples 
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3 and 4 construct codes for this scenario. For the second case, 
we consider the MSAT channel with light shadowing. Example 
5 constructs a code for this case. 

We would like to add that the code construction technique is 
universal and is by no means restricted by the aforementioned 
assumptions. For the fading channels in general, the error per- 
formance of a code primarily depends on its minimum symbol 
distance, the minimum product distance and path multiplicity. 
It depends on the minimum squared Euclidean distance to 
a lesser extent. Detailed discussion on these parameters of 
interest is given in [4] and [5], As such, we will not reiterate 
these design considerations here. The dominant parameter of 
interest is, however, the minimum symbol distance, and as 
such we will concentrate on optimizing this parameter, when 
we construct codes for the fading channel. 

This paper is organized as follows: In Section II of this 
paper, we will derive general analytical bounds on the perfor- 
mance of the modulation codes using the multistage decoding 
techniques proposed in Part I of this paper. In Section III, 
we will construct examples using the proposed technique and 
compare them with the codes listed in literature. 

II. PERFORMANCE ANALYSIS 

In this section, we will derive a general expression for 
the bit-error probability (BEP) of the multidimensional trellis 
coded modulation (TCM) codes decoded using the multistage 
technique proposed in [1, Section V]. 

For 1 < i < q, let be a random variable, where the value 
of X t denotes the number of bit errors at the zth decoding stage 
at a particular time instant t. Hence, 0 < X{ < k t . Then, the 
BEP of the multidimensional TCM code, denoted ft(c), is 

P 6 (e)=E^A,j/|> 

/ q 

= (E(-Yi) + E(X 2 ) + • • • + E(X,)) /Y, k ' 

(2.1) 

where E( ) denotes the expectation operator. For 2 < i < q, 
E(Xi) can be broken up into two terms, the first one being the 
expected number of errors at the rth stage assuming that the 
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previous i ~ 1 stages of decoding are correct and the second 
one being the expected number of errors at the ith stage due to 
erroneous decoding at either one of the previous i — 1 stages 
of decoding, i.e., the error propagation term. Hence, 

E(X t ) < (E(Xi)\ 

error propagation * PE t ) 4“ E(A t ) (ith stage error 

( 2 . 2 ) 

where E(Ai)| error propagation denotes the error propagation 
term, p£\ denotes the probability of error propagation from 
the previous stages and E(X I )| it h stage error denotes the term 
due to erroneous decoding at the z-stage, assuming that the 
previous i - 1 stages of decoding are correct. Hence, (2.1) can 
be rewritten in the following form: 

Pb(e) < ^ ^ (E( X j ) jerror propagation * PE r ) 

\z=2 

+ y^(E(^t)ltthstage error ) J j ^ 2 ^. ( 2 . 3 ) 

i= 1 /' i = 1 

Except for a few specific cases, it is not possible to obtain 
a general expression for the expected number of bits in error 
due to error propagation. The expected number of bits in error 
due to error propagation depend on both the choice of the 
inner codes as well as the outer codes, as will be shown in 
the examples to be discussed later in this paper. As such, we 
will therefore derive a general expression for the rest of the 
terms in (2.3). 

Let V be the transmitted code sequence. Using [1, eq. 
(3.12)] V can be written in the form A(^i(v!) + <f> 2 (v 2 ) + 

K<£ g (v q )), where vj for 1 < i < q denotes a code sequence 

in the convolutional code at the zth stage, Ci. 

For 1 < i < (q — 1), let us consider the term 

E(*0U 

stage error* Recall from [1, Section V] that at the 
zth stage of decoding, we form the trellis A (Ci), where a code 

sequence in A(C;) is of the form A(^ 1 (v 1 ) + <^ 2 (v 2 ) + h 

4- &(uj) + u;*), where Ui is a code sequence 
in the convolutional code at the zth level, C*, is a 
sequence of points from Qi, and for 1 < J < (i - 1 ), vj 
denotes the estimate of vj . Since we are considering the term 
E(Xi)|i t h stage error? Vj = Vj for 1 < j < (i — 1). Also, since 
C x is a linear code, the code sequence Ui can be written in 
the form iij = vj 4* e, where e is code sequence in C*. As 
such, any code sequence in A (Ci) can be rewritten in the form 
V = A(0i(vi)+^ 2 (v2) + -*-+^i. 1 (vi_ 1 )+^(v i +e)+Wi). 
Say, that the decoder at the zth stage of decoding decodes 
the code sequence associated with the convolutional code to 
be vi + e, and let the probability that the event occurs be 
Pe- The exact expressions for p e can be found in [2] and 
[3] for the AWGN channel and in [4] for the Rayleigh-fading 
channel. Let I e denote the number of nonzero information bits 
associated with the sequence e. Then the expected number of 
bits in error (per decoding time instant) due to the sequence 
e is / e * p e . Since e is any arbitrary code sequence in the 
convolutional code C t , the total number of bits in error at 
the ith stage, E(A*)|t t h stage error is obtained by considering 
all the possible code sequences and adding up all the I e • p e 


terms, i.e.. 


E(*)l 


ith stage error 


< } ] * Pe 

eeCi 


(2.4) 


where Ci denotes the set of all the code sequences in the 
convolutional code, 

Special Case — AWGN Channel: For the results derived 
above, let us consider the special case when the channel 
is AWGN. Let V be the transmitted code sequence and 
let V be the decoded code sequence. Both these sequences 
have the form as given earlier. Let denote the minimum 
squared Euclidean distance between V and V. Since vj for 
1 < j < (2 — 1 ) is arbitrary, D\ has been taken to be 
the minimum over all possible transmitted code sequences 
for a fixed e. This is the worst case scenario, and as such 
the minimum squared Euclidean distance Dl gives us an 
upper bound on the performance of the code. Also, let 
N e be the number of codewords at a squared Euclidean 
distance of Dl from V. The probability that V is decoded 
incorrectly depends upon both D\ as well as N e [3]. The 
code sequences v\ and e can be written in the general form 
vi = (Vi,i, ■ * ■)> and e = (e x , e 2 , . . . , e p , * • •)? 
where and e p for 1 < p < oc denotes the output sequence 
(rii bits) of Vi and e, respectively, at the pth time instant. 
The minimum squared Euclidean distance between V and 
V at the pth time instant depends only on e p and let this 
squared Euclidean distance be denoted by D\ . Also, let N Cp 
be the corresponding number of nearest neighbors [3]. Then, 
D l = ££Li D l v and N e = n~i N e„- Dl and N e can be 
evaluated using the technique proposed in [3]. 

E(X 9 ) 1 9 th stage error depends on whether the gth level 
of encoding uses a convolutional code or is left uncoded. 
If a convolutional code is used at the qth level, then 
the expressions for E(A^)| ?t h stage error are the same as 
those derived above. However, if the 9 th level is left 
uncoded then E(X 9 )| 9t h stage error can be upper bounded 
as E(A 9 )| 9th 

stage error ^ BER^ * k q , where BER 9 denotes 
the decoding error probability (i.e., the block error probability) 
for the last stage of decoding, i.e., the block of k q bits at the 
qth stage of decoding would be declared to be in error if at 
least one of the bits is in error. The block error probability 
would depend on the decoding algorithm used at the qth stage, 
i.e., single-stage or multistage. The block error probability can 
be calculated using results of [ 6 ]. 

A very interesting and special case of the results derived 
above occurs when q = 2 and the second level outer code is 
left uncoded, as shown in [1, Fig. 2]. For this special case, we 
can get a closed-form expression for P&(e). Using (2.1) and 
( 2 . 2 ), Pb(e) can be written in the form: 


Pb(e) < at E(X 0 |i 


stage error 


\\.=i 

^^(^2) jerror propagation^ ' PEo ^ j (^-1 + ^2 ) 


(2-5) 


E(A"i) | ui stage error can be derived using (2.4). 

E(^2)|2nd stage error can be upper bounded as 
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Fig. 1. An 8PSK signal constellation and its signal labels. 

E(A r 2 )| 2nd stage error < BER 2 ■ k 2 . Let V be the transmitted 
code sequence. Then, using [1, eq. (3.12)], V can be written 
in the form A(<pi( v i) + ^ 2 ), where vx is a code sequence 
in the convolutional code used at the first level, C\ and 
<jl >2 is a sequence of points from fl 2 . Let the decoded 
code sequence associated with the convolutional code be 
vi + e, where e is a code sequence in C\. p e gives us 
the corresponding probability of this event. Let tufc(e) 
denote the branch weight of e. Hence, the error sequence 
e will cause at most u^(e) blocks of k 2 bits at the second 
stage to be in error, i.e., the number of bits in error at the 
second stage of decoding, due to the error sequence e is 
< k 2 • w*,(e). Using arguments similar to those used to derive 
(2.4), (E(A 2 ) [error propagation • Pe 2 ) can be upper bounded 
as(E(A 2 )| err or propagation ’ PE? ) — ^>e£Ci ^2 * Po- 

III. Examples 

Examples 1 and 2 construct codes for the AWGN channel. 
Examples 3 and 4 construct codes for the Rayleigh-fading 
channel and Example 5 constructs a code for the light shad- 
owed mobile satellite communication (MSAT) channel. In the 
following, we will use (n, k, d) to denote a linear block code 
of length n, dimension k and minimum distance d. 

Example 1: Consider the case of m = 8, q = 2 and 
choose S = &PSK. Hence l = 3. Fig. 1 shows the two- 
dimensional (2-D) 8PSK signal constellation of unit energy, 
in which each signal point is uniquely labeled with three 
bits, abc , where a is the first labeling bit and c is the last 
labeling bit. The labeling is done through signal partitioning 
process [2]. Choose Co,i = (8,4,4) Reed-Muller (RM) code, 
Co, 2 = C u 2 = (8,7,2) code C 0 ,3 = C x% 3 = (8,8,1) code, 
and Ci,i = (8,1,8) code. The minimum squared Euclidean 
distance of Ao = A(flo) is 2.344 and for Ai = A(fix) is 
4.0 [6]. The encoder structure will be the same as that in 
[1, Fig. 2]. A rate-2/3 code will be used at the first level. 
Two choices will be considered for the convolutional code at 
the first level. The first choice is the four-state, ds . free = 2 
code from [1, Table II] and the second choice is the 16-state, 
ds- free = 3 code from [1, Table II]. The phase invariance 
of the resulting code is the same for both the choices and is 


45° and can be derived by a straightforward application of [1, 
Theorem 7]. The spectral efficiency is also the same for both 
the choices and is equal to (16 + 2)/8 = 2.25 bits/symbol. The 
mapping <f>\ used is linear. Details of <p\ have been omitted due 
to lack of space. The following gives a detailed discussion for 
both the choices. 

Four State: The minimum squared Euclidean distance of 
the code is (refer to [1, Theorem 5]): min{4. 0,2.344-2} = 4.0. 
Using [1, eq. (3.12)], any code sequence in the super trellis 
can be written in the form A(<£x( v i) + where v x is 
code sequence in the 2/3-rate convolutional code used at 
the first level and wi is a sequence of points from fix. As 
such, the super trellis for this code is isomorphic to the 
trellis of the convolutional encoder used at the first level, 
with each branch of the trellis consisting of 2 16 parallel 
transitions corresponding to the 2 16 elements of fix- fix has 
a four-state, eight-section trellis diagram [6]. Each branch of 
the super trellis can be expressed in the form A(a;o 4- fli), 
where u>o € [flo/^i]- Hence, each branch of the super trellis 
has a four-state, eight-section trellis, which is isomorphic to 
the trellis of f 1\. Standard Viterbi decoding can be used on 
every branch of super trellis using this four-state, eight-section 
isomorphic trellis to find the most probable parallel transition. 
The trellis of the overall multidimensional code can thus be 
viewed as a nested trellis diagram , i.e., a trellis within a trellis. 

A reduction in the decoding complexity can be achieved 
by using the multistage decoding algorithm proposed in [1, 
Section V]. The decoding now proceeds in two stages. Let 
V be the transmitted code sequence. Using [1, eq. (3.12)], V 
can be written in the form A(^>x(vx) + ujJ r ), where vx is a 
code sequence in the convolutional code C\ used at the first 
level, and u^ r is a sequence of points from fix. At the first 
stage of decoding, we form the trellis C^ up , where any code 
sequence in Cl up can be written in the form 0x( u i) + <-^ up , 
where u ^ up denotes a sequence of points from £l\ up and ux 
is a code sequence in C\. The details of how the trellis Ci Up 
is formed were mentioned in [1, Section V]. flj Up is chosen 
to be: Q\ up = (8,1,8) * (8,8,1) * (8.8,1) which has a very 
simple two-state trellis structure. On the other hand, has a 
four-state eight-section trellis diagram which is more complex 
than the trellis structure of fl* up . This helps in reducing the 
closest coset decoding complexity associated with the first 
stage of decoding. Standard Viterbi decoding is performed on 
the received sequence using the trellis A(Cj Up ) to obtain an 
estimate of v x , denoted v x . This completes the first stage of 
decoding. 

At the second stage of decoding, we construct the trellis C 2 , 
where a code sequence in C 2 is of the form <f>\ (v x )+u?x, where 
u?x denotes a sequence of points from fix. Consider the pth 
time instant. The structure of C 2 at the pth time instant is of 
the form C 2 , p = ^i(vi, p ) + Slu where Vi, p is the component 
of Vj at the pth time instant. This trellis C 2iP is isomorphic to 
the trellis Tii and this trellis can be used to obtain an estimate 
of u>\ T , where u)\ T p is the term in u>\ r corresponding to the 
pth time instant. 

The decoding complexity associated with the second stage 
of decoding can be further reduced by using the three-stage 
decoding technique for fix proposed by Sayegh [7] and Tanner 
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[8]. We will carry out the second stage of decoding using the 
three-stage decoding technique mentioned above. 1 * * 

The multistage decoding algorithm does lead to a slight 
degradation in performance, however, as will be shown in 
the performance curves, the loss is negligible as compared to 
the reduction in complexity. The following gives the number 
of computations associated with both the optimal and the 
multistage decoding algorithm for the four- state trellis. The 
complexity calculation for the multistage decoding algorithm 
have been carried out assuming the three-stage decoding for 
the second stage, as mentioned above. 

Computation Complexity — Optimal Decoding Algorithm: 
7i = 2 and k\ = 2 : The branch decoding complexity Bc x is: 
1) since there are eight 8PSK points per branch, the distance 
computation complexity per branch is 64; 2) the survivor 
calculation for the parallel branch transitions in tti requires 
32 compares; and 3) the Viterbi decoding for Qi requires 
52 adds and 27 comparison to calculate the final survivor 
(assuming the survivor for the parallel transitions has been 
found). Since there are eight cosets, the total complexity is 416 
adds and 216 compares, i.e., Bq 1 =416 adds + 248 compares 
+ 64 distance computations. Hence, total complexity is 54 
adds 4- 32.5 compares -f 8 distance computations per two 
dimensions. 

Computation Complexity — Multistage Decoding Algorithm: 
7i = 2 and k\ — 2 \ The branch decoding complexity is: 
First stage of decoding: 1) There are eight 8PSK points 
per branch, hence, the distance computation complexity per 
branch is 64; 2) the suboptimal distance estimates [8] require 
48 compares; 3) Viterbi decoding of requires 14 adds and 
one compare. Since there are eight cosets, the total complexity 
is 1 12 adds and eight compares. 

Second stage of decoding: 1 ) The multistage decoding 
technique requires 26 adds and 13 compares; hence, total 
complexity is 19.25 adds + 10.125 compares 4- 8 distance 
computations per two dimensions. 

Fig. 2 shows the simulation results of the bit-error per- 
formance of both the optimal and the multistage decoding 
algorithm. An upper bound on the bit-error rate (BER) of the 
proposed code is also shown in Fig. 2. Details of the bound 
have been omitted due to lack of space. Also shown in the 
figure is the bit-error performance of a hypothetical uncoded 
phase shift keying (PSK) system of the same spectral efficiency 
[ 10 ]. 

Fig. 2 shows that the multistage and optimal decoding 
curves converge around Eb/N 0 = 8 dB, and the performance 
of the optimal curve is only slightly better at low signal-to- 
noise ratio (SNR). The proposed code achieves a coding gain 
of 2.8 dB at the decoded BER of 10~ 6 over the uncoded refer- 
ence system of the same spectral efficiency [10]. In addition, 
the decoding complexity of the optimal decoding algorithm 
is roughly about three times the decoding complexity of the 
suboptimal one. 

Pietrobon et al [3] do not have a comparable code over 8 x 
2 dimensions, hence, comparison will be made with a 4 x 2- 

1 Note, the first stage of the three-stage decoding process for Hi can actually 

be combined with the first stage of decoding of the TCM code, i.e., the stage 

which uses the trellis C* up . 


dimensional code over 8PSK with 7 = 2 and phase invariance 
= 45°. Spectral efficiency of this code is 2.25 bits/symbol, 
same as that of the proposed code. The performance curve 
of this code, taken from [12], has also been shown in the 
figure. The complexity of the Pietrobon code is 24 adds 4- 
17 compares + 8 distance distance computations per two 
dimensions. As can be seen from the figure, the proposed code 
outperforms the Pietrobon code by roughly 0.4 dB at 4 10~ 6 
BER, and in addition, the complexity of the proposed code 
with multistage decoding is less than that of the Pietrobon 
code. 

16 States: The minimum squared Euclidean distance of the 
code is (refer to [1, Theorem 5]) min{4.0, 2.344 • 3} = 4.0. 
The super-trellis in this case is very similar to the four-state 
trellis discussed above, with the only difference that the four- 
state convolutional code at the first level, has been replaced 
by the 16-state trellis. Both the optimal and the multistage 
decoding techniques will be investigated for this case also. 
The complexity associated with the optimal and the multistage 
decoding technique are as follows. 

Computation Complexity — Optimal Decoding Algorithm: 
7i = 4 and ki = 2 : The branch decoding complexity Bq x is 
the same as the four-state case. Therefore, total complexity is 
60 adds 4- 37 compares 4- 8 computations per two dimensions. 

Computation-Complexity — Multistage Decoding Algorithm: 
71 = 4 and k\ = 2 : The branch decoding complexity is the 
same as the four-state case. Therefore, the total complexity is 
25.25 adds + 14.625 compares 4 8 distance computations per 
two dimensions. 

Fig. 3 shows the bit-error performance of the both the 
optimal and the suboptimal-decoding algorithm. An upper 
bound on the BER of the proposed code using the multistage 
decoding algorithm is also shown in Fig. 3. 

Fig. 3 shows that the multistage and the optimal decoding 
curves exhibit the same characteristics as the four-state case. 
The two curves converge around E^/No = 6.54 dB, and the 
performance of the optimal curve is only slightly better than 
the optimal curve at low SNR. The proposed code achieves 
a coding gain of 3.2 dB at the decoded bit-error-rate of 
10“ 6 over the uncoded reference system of the same spectral 
efficiency [10]. In addition, the decoding complexity of the 
optimal decoding algorithm is roughly about 2.5 times the 
decoding complexity of the multistage one. 

Pietrobon et al [3] do not have a comparable code over 
8 x 2-dimensions, hence, comparison will be made with a 
4 x 2-dimensional code over 8PSK with 7 = 3 and phase 
invariance = 45°. The spectral efficiency of this code is 2.25 
bits/symbol, i.e., it is the same as that of the proposed code. 
The performance curve of this code, taken from [9], has also 
been shown in the figure. The complexity of this code is 48 
adds 4- 32 compares 4- 8 distance distance computations per 
two dimensions. The performance of the proposed code is 
slightly better than the Pietrobon code and in addition the 
complexity of the Pietrobon code is about two times higher 
than that of the proposed code with multistage decoding. 

The 16-state proposed code with the multistage decoding 
algorithm achieves better performance than the four-state 
proposed code with the multistage decoding algorithm at the 
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Fig. 3. BEP of the code in Example 1 with a 16-state encoder at the first level for the AWGN channel. 


cost of slightly increased decoding complexity. The improve- 
ment in performance is due to the higher minimum squared 
Euclidean distance of the first decoding stage of the 16-state 
code. This leads to better performance at the first decoding 
stage and as a result reduced error propagation onto the second 
decoding stage. 

Example 2: Consider the case of m = 16, q = 3 and 
choose S = 8PSK. Hence, £ = 3. Choose Co,i = (16,4,8) 
code. This code is obtained from the first-order RM code of 


length 16, by removing the all ones vector from the generator 
matrix of the (16,5) code. Choose Co. 2 = (16,11,4) RM 
code, Co ,2 = Co , 3 = Ci , 2 — Ci ,3 — C 2.3 = (16, 16,2) code 
and C u = C 2a = (16,0,oc) code, i.e., the code consisting 
of just the all zero codeword. The minimum squared Euclidean 
distance for Ao = A(fio) is 4.0, for Ai = A(fii) is 4.0 and 
for A 2 = A(H 2 ) is 8.0 [6]. A rate-3/4 code with 64-states 
(second code in [1, Table III]) will be used at the first level. 
Let us call this code Ci. The same rate-3/4 code used at 




182 


IEEE TRANSACTIONS ON COMMUNICATIONS. VOL. 45, NO. 2. FEBRUARY 1997 


the first level will be used at the second level. Let us call 
this code C 2 . The phase invariance of the resulting code is 
90°. The spectral efficiency is equal to (3 + 3 + 26)/16 = 2 
bits/symbol. The mappings <f>i and <f >2 used at the first and 
second encoding levels respectively have been chosen to be 
linear. The minimum squared Euclidean distance of the code is 
at least (refer to [1, Theorem 5]), min{8.0, 3-4.0, 3-4.0} = 8.0. 
Note that the theorem gives the minimum squared Euclidean 
distance associated with the first encoding stage to be at 
least 12.0. A quick verification of the partitions given above 
show that the minimum squared Euclidean distance is actually 
3 x 8 x 0.586 = 14.064. This is obtained by considering the 
squared Euclidean distance due to the (16, 4) code of f* 0 and 
multiplying it by the free branch distance of C\. 

Optimal decoding of the multidimensional code would 
require a trellis with 2 6 * 2 6 = 2 12 states. Optimal decoding 
of the code using this 4096 state trellis would be extremely 
complex, and as such we will focus on the multistage decoding 
technique proposed in [1, Section V]. The multistage decoding 
of the multidimensional code proceeds in three stages. 

Let V be the transmitted code sequence. Using [1, eq. 
(3.12)], V can be expressed in the form A(<fo(vi) + ^>2(v 2 ) + 
1^2)* where vi is a code sequence in the 64-state convolutional 
code Ci , v 2 is a code sequence in the 64-state convolutional 
code C 2 and u? 2 is a sequence of {joints from f) 2 - 

First stage of decoding: To simplify the trellis decoding 
complexity associated with the first stage of decoding, instead 
of forming the trellis C\ we form the trellis C^ up , where any 
code sequence in Ci up can be written in the form (refer to [1, 
Section V]), <j>i (u x )+o^ up , where u>\ up is a sequence of points 
from Q, s x up and uj is a code sequence in C\. f^ up is chosen 
to be, fij up = (16, 0, 00) * (16, 16, 1) * (16, 16, 1). has 
a very simple one-state trellis structure. On the other hand. 
Cl 1 has a four-state trellis diagram which is more complex 
than the trellis structure of Q,\ up . This helps in reducing the 
closest coset decoding complexity associated with the first 
stage of decoding. Standard Viterbi decoding is performed 
on the received sequence using the trellis Cj Up to obtain an 
estimate of v a , denoted v\. This completes the first stage of 
decoding. 

Second stage of decoding: To simplify the trellis decod- 
ing complexity associated with the second stage of decoding, 
instead of forming the trellis C 2 , we form C£ up , where any 
code sequence in C 2 up can be written in the form (refer to [1, 
Section V]), ^(v^+^i^J+u;^, where is a sequence 
of points from tiff 9 and u 2 is a code sequence in C 2 . 
is chosen to be: ft 2 up = (16,0, 00) * (16, 11,4) * (16, 16, 1). 
ft 2 up has a eight-state trellis structure [1 1]. On the other hand, 
ft 2 has a 16-state trellis diagram which is more complex 
than the trellis structure of ft 2 up . This helps in reducing the 
closest coset decoding complexity associated with the second 
stage of decoding. Standard Viterbi decoding is performed 
on the received sequence using the trellis C 2 up to obtain an 
estimate of v 2 , denoted v 2 . This completes the second stage 
of decoding. 

Third stage of decoding: The third stage of decoding is 
identical to the second stage of decoding discussed in Example 
1. The three stage decoding technique proposed by Sayegh [7] 


and Tanner [8] is used to split up the decoding of 0 2 into 
three stages. The first stage decoding of ft 2 is trivial. Note, 
the second stage of the three-stage decoding process for ft 2 
can be combined with the second stage of decoding of the 
multidimensional TCM code. 

Computation Complexity — Multistage Decoding Algorithm: 
7i — 6, ki = 3, 72 = 6, fc 2 = 3: The branch decoding 
complexity is: 

First stage of decoding: 1) The distance computation 
complexity per branch is 128; 2) the suboptimal distance 
estimates require 96 compares; 3) Viterbi decoding of 0^ up 
requires three adds. Since there are 16 cosets, the total 
complexity is 48 adds. 

Second stage of decoding: 1) The closest coset decoding 
for ft s 2 up requires 184 adds + 87 compares, which is the trellis 
decoding complexity of the (16, 11, 4) code [11]. Since there 
are 16 cosets, the total complexity is 2944 adds and 1392 
compares. 

Third stage of decoding: 1) The multistage decoding 
technique for ft 2 requires 58 adds and 29 compares. Note 
that only the decoding complexity of the (16.15,2) code 
has been taken into account. The decoding complexity of the 
(16, 11,4) code is included in the second stage of decoding 
for reasons mentioned above. Hence, the total complexity is 
254.62 adds + 150.81 compares + 8 distance computations 
per two dimensions. 

Fig. 4 shows the simulation results of the bit-error per- 
formance of multidimensional TCM code. As can be seen 
from the figure, the code achieves a 4.2 dB coding gain over 
uncoded quaternary phase shift keying (QPSK) at 10“ 6 BER. 
An upper bound on the BER of the proposed code using the 
multistage decoding algorithm is also shown in Fig. 4. 

Pietrobon et al [3] do not have a comparable code over 
16 x 2 dimensions, hence comparison will be made with a 
2 x 2-dimensional code over 8PSK with 7 = 7 and phase 
invariance = 90°. The spectral efficiency and phase invariance 
of both codes is the same. This Pietrobon et al code is the 
best in performance among all the codes listed in [3] for rate 2 
bits/symbol. The performance curve of this code, taken from 
[9], has also been shown in the figure. The complexity of 
the Pietrobon code is about two times higher than that of the 
proposed code, however, the proposed code has performance 
comparable to the Pietrobon code at high SNR. 

Example 3: Consider the case of m = 2, q = 3 and choose 
S — 8PSK. Hence, £ = 3. Choose C0.1 = Co, 2 — Co, 3 = 
Ci, 2 = Ci,3 = (2,2,1) code, C 2 ,3 = (2,1,2) code and 
C 24 — C 2 , 2 = C11 = (2,0, 00) code. The minimum symbol 
distance of Ao = A(fto) is 1, for Ai = A(fti) is 1 and for 
A 2 = A(ft 2 ) is 2 (refer to [1, Section HI]). The other distance 
parameters associated with the three block modulation codes 
can be found by a straightforward application of the distance 
theorem in [5]. A rate- 1/2 code with 16- states (fourth code in 
[1, Table I]) will be used at the first level. Let us call this 
code Ci. A rate-2/3 code with 16-states (second code in [1, 
Table II]) will be used at the second level. Let us call this code 
C 2 . The phase invariance of the resulting code is 180°. The 
spectral efficiency is equal to (1 + 2+1) /2 = 2 bits/symbol. 
The mappings <j>\ and 0 2 have been chosen to be linear. 
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Fig. 4 . BEP of the code in Example 2 for the AWGN channel. 


The minimum symbol distance of the code is (refer to [1, 
Theorem 6]), min{2, 3 1,5-1} = 2. Since the minimum sym- 
bol distance of the overall modulation code is the minimum 
symbol distance of A 2 , hence the minimum product distance, 
A 2 of the modulation code is (4.0) 2 = 16.0 (refer to [5]). 

The decoding of this code is carried out in three stages and 
proceeds exactly as discussed in [1, Section V]. The second 
and third stage of decoding can actually be combined into 
one single stage of decoding. The computational complexity 
calculated below assumes that the second and third decoding 
stages have been combined. 

The minimum symbol distance of the first stage is chosen to 
be higher than the rest of the decoding stages, so as to reduce 
the effect of error propagation. 

Computation Complexity — Multistage Decoding Algorithm: 
7i — 4. A'i = I.72 = 4, k 2 = 2 : The branch decoding 
complexity is: 

First stage of decoding: 1) The distance computation 
complexity per branch is 16; 2) the suboptimal distance 
estimates require 12 compares; and 3) Viterbi decoding of 
requires 1 add. Since there are four cosets, the total 
complexity is 4 adds. 

Second and third stage of decoding: 1) Viterbi decoding 
of n 2 is 2 adds -I- 1 compares. Since there are 8 cosets, 
the total complexity is 16 adds and 8 compares. Therefore, 
the total complexity is 58 adds + 42 compares 4- 8 distance 
computations per two dimensions. 

Fig. 5 shows the simulation results of the bit-error perfor- 
mance of the proposed code. The performance of this code 
will be compared with the 16-state rate-2/3 code over 8PSK 
constructed by Schlegel and Costello [13] for the Rayleigh- 
fading channel. The spectral efficiency for both codes is 
the same, however, the Schlegel-Costello code has no phase 
invariance. The performance curve of the Schlegel-Costello 
code is also shown in Fig. 5. As can be seen from the figure, 
the proposed code outperforms the Schlegel-Costello code by 
about 1.6 dB at 10“ 4 bit error rate. In addition, the complexity 


of the Schlegel-Costello code is 64 adds -f 48 compares + 
8 distance computations per two dimensions which is slightly 
higher than that of the proposed code. 

Example 4: Consider the case of m = 8, q = 4 and choose 
S = 8PSK. Hence, t = 3. Choose C 0 ,i = C 12 = C 3 , 2 = 
C3 3 = (8,4,4) RM code, Co, 2 = CV2 = (8,7.2) code, 
C 0l 3 = Ci , 3 = C 2 , 3 = (8,8, 1) code and C iA = C 2A = 
C 3 1 = (8,0, oc) code. A rate-3/4 code with eight-states (first 
code in [1, Table III]) will be used at the first level. Let us call 
this code C\. A rate-2/3 code with 16-states (second code in 
[1, Table II]) will be used at the second level. Let us call this 
code C 2 - A rate-3/4 code with 64-states (second code in [L 
Table III]) will be used at the third level. Let us call this code 
C 3 . The phase invariance of the resulting code is 180°. The 
spectral efficiency is equal to(3+2+3+8)/8 = 2 bits/symbol. 
The mappings 0 i, 02>$3 and <^4 are chosen to be linear. 

The decoding of this code is carried out in four stages and 
proceeds in a manner similar to that in Example 2. The first 
stage of decoding is similar to the first stage of decoding in 
Example 2. used to simplify the decoding complexity 
is : n* up = (8,0, 00) * (8,8.1) * (8,8.1). has a very 

simple one-state trellis which is less complex than the two- 
state trellis of fii. The second and third stage of decoding is 
carried out exactly as described in [1, Section V]. The fourth 
stage of decoding is carried out using the multistage decoding 
technique for f2 3 (as was explained in Example 1). The 
multistage decoding of f 2 3 proceeds in two stages. The first 
stage of decoding decodes the code C 3i2 ami the second stage 
decodes the C 3(3 code. The decoding of C 3 , 2 can be merged 
with the second stage of decoding of the proposed code, and 
the decoding of 6/3,3 can be merged with the third stage 
decoding of the proposed code. The complexity calculations 
given below assume that the fourth stage of decoding of the 
proposed code has been merged with the previous stages. 

Computation Complexity — Multistage Decoding Algorithm: 
71 = 3 , k x = 3, 72 = 4 , k 2 = 2, 73 = 6, k 3 = 3 : The branch 
decoding complexity is: 
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Et/No 


Fig. 5. BEP of the code in Example 3 for the Rayleigh-fading channel. 



Eb/N 0 


Fig. 6. BEP of the code in Example 4 for the Rayleigh-fading channel. 


First stage of decoding: 1) The distance computation 
complexity per branch is 64; 2) the suboptimal distance 
estimates require 48 compares; 3) Viterbi decoding of f^ up 
requires 7 adds. Since there are 16 cosets, the total complexity 
is 112 adds. 

Second stage of decoding and the first stage of the fourth 
stage of decoding: 1) The closest coset decoding complexity 
is 36 adds and 11 compares, which is the trellis decoding 
complexity of the (8, 4, 4) code [11]. Since there are 8 cosets, 
the total complexity is 288 adds and 88 compares. 

Third stage of decoding and the second stage of the fourth 
stage of decoding: 1) The closest coset decoding complexity 


is 36 adds and 11 compares, which is the trellis decoding 
complexity of the (8, 4, 4) code [11]. Since there are 16 cosets, 
the total complexity is 576 adds and 176 compares. Therefore, 
total complexity is 202 adds +108 compares + 8 distance 
computations per two dimensions. 

Fig. 6 shows the simulation results of the bit-error perfor- 
mance of the proposed code. The performance of this code 
will be compared with the 64-state rate-2/3 code over 8PSK 
constructed by Schlegel and Costello [13] for the Rayleigh- 
fading channel. The spectral efficiency for both codes is 
the same, however the Schlegel-Costello code has no phase 
invariance. The performance curve of the Schlegel-Costello 
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Fig. 7. BEP of the code in Example 5 for ashadowed mobile satellite channel. 


code is also shown in Fig. 6. As can be seen from the figure, 
the proposed code outperforms the Schlegel-Costello code by 
about 1.5 dB at 2 • 10 -4 BER. In addition, the complexity of 
the Schlegel-Costello code is 256 adds -I- 192 compares + 
8 distance computations per two dimensions which is higher 
than that of the proposed code. 

Example 5: A statistical model for the shadowed mobile 
satellite channel has been devised by Loo [ 14]— [17] and 
this model has been used by other researchers [ 1 8]— [23] to 
study the error performance of coded modulation schemes 
over the MSAT channel. In Loo's model, there are three 
different kinds of shadowing-light, average and heavy. The 
corresponding Rician factors are 6.16, 5.46, and —19.33 dB, 
respectively. Therefore, in the shadowed MSAT channel, a 
coded modulation system suffers very severe distortion due to 
randomly changing phase and multipath fading. Especially, 
if the Doppler frequency shift is large due to the motion 
of vehicle, a coded modulation system faces the error floor 
phenomenon. We will assume that the carrier frequency is 
870 MHz and the symbol rate is 2400 symbols/s. Due to 
randomly changing phase, perfect phase synchronization is not 
feasible in the shadowed MSAT channel. Therefore, differen- 
tially detected 8PSK modulation is used. We assume that the 
speed of moving object is 92.88 miles/h. The corresponding 
normalized fading bandwidth BT is 0.05, where B is the 
maximum Doppler frequency shift and T~ l is the symbol 
rate. To combat burst errors, a block interleaver is used for 
computer simulation. The size of interleaver is 512 8DPSK 
symbols, and the number of rows of the block interleaver is 
64 and the number of columns is 8. 

Consider the case of m = 8 and q — 3. Hence, l = 3. 
Choose Co,i = C 2,2 = (8,4,4) RM code, Co , 2 — Co t 3 = 
Ci f 2 — C*i,3 — C*2,3 — ( 8 , 7,2) code and — C2,i — 
(8, 0, oc) code. A rate-3/4 code with eight-states (first code 
in [1, Table III]) will be used at the first level. Let us call 
this code C\. A rate-2/3 code with 16-states (second code in 
[1, Table II]) will be used at the second level. Let us call 


this code C 2 . The phase invariance of the resulting code is 
90°. The spectral efficiency is equal to (3 4* 2 + 1 1)/8 = 2 
bits/symbol. The mappings <j>\ and <f >2 used at the first and 
second encoding levels are linear. 

Decoding of the code proceeds exactly as in Example 2, and 
as such, will not be repeated here. The complexity calculations 
are also very similar to Example 2, and as such details will be 
omitted. The total complexity is 69.25 adds 4- 31.63 compares 
+ 8 distance computations per two dimensions. 

Fig. 7 shows the simulation results of the bit-error per- 
formance of the proposed code. The performance of this 
code will be compared with the 16-state rate-2/3 code con- 
structed by Schlegel and Costello [13] (this code is chosen, 
for lack of comparable complexity code available in literature 
for the shadowed MSAT channel). The spectral efficiency 
for both codes is the same. The performance curve of the 
Schlegel-Costello code is also shown in Fig. 7. As can be 
seen from the figure, the proposed code outperforms the 
Schlegel-Costello code by about 9.65 dB at 10 -4 bit error rate. 
Also, the proposed code faces the error floor at around 1 .4 x 
10~ 5 BER, whereas the Schlegel-Costello code faces an error 
floor around 4.8 x 10” 5 BER. In addition, the complexity of 
the Schlegel-Costello code is higher than that of the proposed 
code. 

IV. Conclusion 

A simple and systematic technique of constructing multi- 
dimensional TCM codes using block modulation codes and 
convolutional codes optimized for branch distance is proposed. 
Bounds on the minimum squared Euclidean distance and 
minimum symbol distance of the multidimensional TCM codes 
are derived, along with conditions on phase invariance. A 
multistage decoding technique for the multidimensional TCM 
codes has also been proposed. Examples constructed show that 
the technique can be used to construct good codes which have 
a performance/decoding complexity advantage over the codes 
available in literature for both the AWGN and fading channels. 
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